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1. Introduction 

In this paper we will discuss a number of loosely related questions, which em- 
anate from Thurston's geometrization program for three-dimensional manifolds, 
and the general Thurston "yoga" that most everything is hyperbolic. We venture 
quite far afield from three-dimensional geometry and topology - to the geometry 
of higher rank symmetric spaces, to number theory, and probability theory, and to 
the theory of finite groups. In Section [2] we describe the underpinnings from the 
theory of three-dimensional manifolds as envisaged by W. Thurston. In Section 
[3] we will describe one natural approach to describing randomness in groups. In 
Section [5] we describe an approach to actually producing random matrices in lat- 
tices in semisimple Lie groups using the philosophy in Section [3] In Section [6] we 
describe a different approach to randomness, and the questions it raises. 

2. Thurston geometry 

I As far as this paper is concerned, history begins with Bill Thurston's geometriza- 
tion program of three-dimensional manifolds. We will begin with the fibered ver- 
sion The setup is as follows: we have a surface M (a two-dimensional manifold, 
homeomorphic to a compact surface with a finite number of punctures) and a 
homeomorphism <p : M — > M. Given this information we construct the mapping 
torus T<p(M) of <p, by first constructing the product n = M x [0, 1], and then defining 
T(b(M) to be the quotient space of n by the equivalence relation which is trivial out- 
side M x {0, 1}, where (x, 0) ~ ((ft(x), 1). One of Thurston's early achievements was 
the complete understanding of geometric structures on such fibered manifolds. 
To state the next results we will need to give a very short introduction to the map- 
ping class group Mod(M), which is the group of homeomorphisms of our surface 
modulo the normal subgroup of homeomorphism isotopic to the identity - for a 
longer introduction, see the recently published (but already standard) reference 
[19J. In low genus, the mapping class group is easy to understand. For M S 2 , 
| Mod(M)| = 2; every automorphism of the sphere is isotopic to either the identity 
map or the antipodal map. The next easiest case is that of the torus: M - T 2 . Then, 
Mod(M) - GL(2, Z). Looking at this case in more detail, we note that the elements 
of GL(2, Z) fall into three classes: elliptic (those with a fixed point in the upper 
halfplane), parabolic (those with a single fixed point p/q on the real axis in C) and 
the rest (these are hyperbolic, and have two quadratic irrational fixed points on 
the real axis). Elliptic elements are periodic. Parabolic elements leave the (p, q) 
curve on the torus invariant (they correspond to a Dehn twist about this curve). 
Hyperbolic elements leave no curve invariant. Further, one of their fixed points is 
attracting, while the other one is repelling. These two fixed points correspond to 
two orthogonal curves of irrational slope on the torus. 
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In the case where M is the torus with one puncture, Nielsen had proved that 
ModM is the same as for M - T 2 . After that, things were rather mysterious, 
until Thurston discovered his classification of surface homeomorphisms, which 
parallels closely the toral characterization. Thurston's result is that every surface 
homemorphism falls into three classes: it is either periodic, or leaves invariant a 
multicurve y (a collection of simple closed curves on M) - in this case the map is 
allowed to permute the components of y, or pseudo-anosov, in which case the map 
has a pair of orthogonal measured foliations, one of which is expanded by cp and 
the other is contracted. This is a highly non-trivial result which is the beginning of 
the modern two-dimensional geometry, topology, and dynamics. For a discussion 
in considerably more depth, see the standard references l|70l[ll[T2l l2T| . The next 
theorem ties the above discussion into Thurston's geometrization program for 3- 
dimensional manifolds (the special case of fibered manifolds was probably the first 
case of geometrization finished - see J. P. Otal's excellent exposition in ||54| . For 
an in-depth discussion of the various geometries of three-dimensional manifolds, 
see G. P. Scott's paper [165]. 

Theorem 2.1 (Thurston's geometrization theorem for fibered manifolds). Let T^M) 
be as above. Then we have the following possibilities for the geometry ofM. 

(1) IfM a* S 2 , then T^M) is modeled on S 2 x R. 

(2) IfM T 2 , then we have the following possibilities: 

(a) Ifcp is elliptic, then T^(M) is modeled on E 3 . 

(b) Ifcp is parabolic, then T<p{M) is a nil-manifold. 

(c) If (p is hyperbolic, then T^(M) is a solv-manifold. 

(3) IfM is a hyperbolic surface, then 

(a) Ifcp is periodic, then T^(M) is modeled on H 2 x 1R. 

(b) Ifcp is reducible, then TAM) is a graph-manifold. 

(c) Ifcp is pseudo-Anosov, then T^M) is hyperbolic. 

An attentive reader will note there are seven special cases, and six out of the eight 
three-manifold geometries make an appearance. Six out of the seven special cases 
of the theorem are easy, while the proof of the last case|3c]occupies most of the book 
|54J. Thurston's philosophy, moreover, is that "most" fibered (or otherwise) three- 
manifolds are hyperbolic - the first appearance of this phenomenon in Thurston's 
work is probably the Dehn Surgery Theorem (|7T|), which states that moth Dehn 
fillings on a cusped hyperbolic manifold yield hyperbolic manifolds), and the last 
appears in his joint work with Nathan Dunfield Ifl6l H5l , where it is conjectured 
that a random three manifold of fixed Heegard genus is hyperbolic. The actual 
statement that a random fibered manifold is hyperbolic seems to have not been 
published by Thurston, and the honor of first publication of an equivalent question 
goes to Benson Farb in 1120 II . We will discuss Farb's precise question below, but 
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first, let's talk about what it means for some property P to be generic for some 
(possibly) infinite (but countable) set S. 



3. An idealist approach to randomness 

First, define a measure of size v on the elements of S. This should satisfy some 
simple axioms, such as: 

(1) v(x) > for all x e S. 

(2) The set Sjt = {x e S \v(x) < k) is finite for every k. 

Let now P be a predicate on the elements of S - think of a predicate as just a 
function from S to {0, 1}. Let P c S be defined as f = {x £ S |P(x) = 1 }, and define 
Pk = {x eP \v(x) <k). We say that the property P is generic for S with respect to the 
valuation v if 



(1) 



km — 



1. 



We say that P is negligible with respect to i> if 
(2) 



lim 



0. 



Sometimes the above two definitions are not enough, and we say that P has 
asymptotic density p with respect to v if 



(3) 



km {^} 



= p. 



These definitions work well when they work. Here are some examples: 

Example 3.1. The set S is the set N of natural numbers, and the predicate P is 
P(x) = is x prime?. The valuation v is just the usual "Archimedean" valuation on 
IN, and, as is well-known, the set of primes is negligible. One can make a more 
precise statement (which is the content of the Prime Number Theorem, see [13 , 50J): 
With definitions as above, 

Example 3.2. Let S be the set of integer lattice points (x, y) £ Z 2 , let Q c E 2 be a 
Jordan domain, and define the valuation on S as follows: 



v(x) = infjf \x G fQ}. 
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Further, define the predicate P by P(x, y) = x is relatively prime to y. such 

points are called visible, since one can see them from the origin (0,0). Then, the 
asymptotic density of P is ^ = Jj. 

The proof of this for Q being the unit square is classical, and can be found (for 
example) in Hardy and Wright ([27|) or in the less classical reference [63J. To 
get the general statement, we first note that the special linear group SL(2, Z) acts 
ergodically on the plane R 2 (see ||76| ). Now, define a measure y. t by 
1 

/^(Q) = ^the number of points such that P(x, y) = 1 in fQ. 

Each /.tf is clearly a measure, dominated by by the Lebesgue measure, and invariant 
under the SL(2, Z) action on R 2 . By Helly's theorem 11351 Section 10.3] It follows 
that the set {mu t } has a convergent subsequence a, and by SL(2, Z) invariance, the 
limit y. a is a constant multiple of the Lebesgue measure, and the constant can be 
evaluated for some specific Q, such as the square (more details of the argument 
can be found in ||3"2~10 . Notice that the constant does not depend on a, so all the 
convergent subsequences of the set {y. t ) have the same limit, which must, therefore, 
be the unique limit point of the set. 

Example 3.3. Consider the free group on two generators F 2 = {a, b) We define the 
valuation v(x) to be the reduced word length of x. Let P be the predicate: P(x) = 
the abelianization a(x) e Z 2 is a visible point. Then P does not have an asymptotic 
density. It does, however, have a an asymptotic annular density, defined as follows: 
Let X c S, where S, as usual, has a valuation satisfying our axioms. We define 
Sk = {x G S \v(x) = k}, and similarly for X^. Then, the k-th annular density of T is 
defined by 

(4) pk(X) = - 1— + — 

Z \ bk-i bk 

We define the strict annular density of X to be p^(X) = lim^oo Pfc(X), if the limit 
exists. The general result (shown in |32J) is: 

Theorem 3.4. Let S be an SL(n,Z) invariant subset ofZ k , and let S = fl _1 S, where a, as 
before, is the abelianization map from the free group on k generators F k to 1}. Then S has a 
strict annular density whenver S has an asymptotic density. Moreover, the two densities 
are equal. 



The proof of Theorem 3.4 uses the ergodicity of the SL(n, Z) action on R", as 
described in Example 3.2 and the central and local limit theorems of [58J (see also 
El) and of 11661 . 

The examples above show that the cases where the groups are reasonably simple 
to describe, the idealistic valuation-based approach is quite successful. However, 
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once the groups are more complicated, this approach often bogs down in at least 
some ways, the principal of which is that when one talks of negligibility, genericity, 
or density, one is making a statement about properties of random elements of the 
set S. However, this raises the question of how to generate such random elements 
(note that the generation method will often hold the keys to our ability to approach 
asymptotic statements. 

Example 3.5. Let S = SL(n, Z). There is a natural family of valuations on S - the 
archimedean valuations associated to the various Banach space norms on the space 
of matricesMn x n. Since all these are known to be equivalent, we might as well 
choose the frobenius norm (the L 2 norm of a matrix x £ SL(n, Z) viewed as a vector 
in Z" x ".). In other words, in our previous language, 



v(x) 



n n 



\ i=i 



a 2 

;'=i 



Let S<k be the set of those x in S with v(x) < k. It is not at all obvious how to find 
the cardinality of S<k, though this has been done (relatively recently) for SL(2, Z) 
(by Morris Newman in B52B - see more on this in Section |4j) and in general by W. 
Duke, Z. Rudnick, and P. Sarnak in (111, A. Eskin and C. McMullen in (T7|. The 

2 

result is that the number of points is asymptotic to a constant times k n ~ n - the 
constant for n = 2 is 6. In any case, enumeration in and of itself is difficult, and 
enumerating subsets seems more difficult still. An example of this is the (simple to 
state) question of finding a uniformly distributed random element of bounded norm 
(see section [5] for more). 

Nevertheless, one can try to show some results on linear groups using the the 
Archimedean valuation as above, sometimes using the very deep results of P. 



Sarnak and A. Nevo as 114911 , which are a major advance on the results of |T4]|. 

Example 3.6. We are using [|59ll60| I show that a generic element of SL(n,Z) has 
irreducible characteristic polynomial; furthermore, the Galois group of the charac- 
teristic polynomial is generically the full symmetric group. In the same references 
I show that the generic element of Sp(n, Z) has irreducible characteristic polyno- 
mial. 

Example 3.7. Let G be a lattice in a semi-simple linear group ©, pick a (non-central) 
element /z £ G and for an element x £ G consider the group H x = (x,h) .Consider the 
predicate P(x) = H x is Zariski dense in ©. In the paper fl64l I show that P is generic 
in G. Similarly, if we H = G x G, then the property P(x, y) = H x is Zariski dense in © 
is generic in G x GThe results are based on strong approximation theory, as devel- 
oped in [46 J; see also 1156 II . 
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Question 3.8. How do we tell if a subgroup G of (say) SL(n, Z) given by its matrix 
generators is Zariski dense? 



There are two different avenues by which to attack Question 3.8 The first is via 
strong approximation techniques: it is known (by [46]) that almost all modular 
projections are surjective for a Zariski dense subgroup G, and further, by the results 
of Weigel |f74f, the group is Zariski dense if any projection modulo some p > 3 is 
surjective. So, we need only check a finite number of possible bad projections, 
which is bounded by the work of Rapinchuk 11571 , but the bound is not what one 
would call practical, so this approach, while aesthetically pleasing, takes a lot of 
work to make work. 

A completely different approach is the brute force attack: take the group, com- 
pute several elements, compute their (matrix) logarithm, and see if the resulting 
elements generate the Lie algebra of SL(n, C) as a vector space. This is a much 
more computationally promising approach, but it requires a lot of work to pro- 
duce provable results (the logarithm can usually be computed only approximately, 
so one needs to find the measure of one's confidence in one's results, etc). The 
method is particularly effective when there are a lot of unipotent elements in the 
subgroup, since the logarithm of a unipotent integral matrix is a matrix with ra- 
tional entries, so no approximation techniques are necessary (this was pointed out 
by to the author by A. Eskin). 

Example 3.9. Jointly with Elena Fuchs ||23l we show if G is a lattice in SL(2, C), and 
H = G x G, then, for any a > 0, the property 

P(x, y) = The Hausdorff dimension of (x, y) is at least a 

is negligible in H. 

The argument uses the ergodicity of the action of SL(2, Z) on the plane (which 
shows that the attractive and repelling fixed points of elements are equidis- 
tributed), and a ping-pong argument, together with bounds on the Hausdorff 
dimension as in 071 . 



Question 3.10. A closely related question to the one discussed in Example 3.9| is 
open: Consider a pair of elements one of which is parabolic. Is it true that the 
property 

P(x, y) = The Hausdorff dimension of (x, y) is at least a 

is negligible for any a > 1 /2? 

It should be remarked that it is a theorem of Beardon that any such group 
does have Hausdorff dimension at least 1/2, so the constant 1/2 in Question 3.10 is 
best possible. 
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It is hoped that the techniques used to attack Example 3.9 can be extended to 
attack Question |3.10| 

Example 3.11. Jointly with Inna Capdeboscq ||TTf we show that if G is a lattice in 
a semi-simple Lie group © of rank at least two, and H = G x G, then if we define 
P(x, y) = (x, y) is profinitely dense, then P(x, y) has asymptotic density bounded 
below. 

The idea of the argument is as follows: 

First observation is that SL(n, Z/NMZ) = SL(n, Z/NZ) x SL(n, Z/MZ), for N,M 
relatively prime. 

Second observation is that random elements (either in the random walk model 
or in the "archimedean height" model) are eventually equidistributed in modular 
projections, e.g modulo the product P of the first k primes (this is one of the 
results of ||59|V By the first observation, the behaviors modulo different primes are 
independent, and so by II3TII3611 the probability that the projections onto the first k 
primes are surjective is bounded below by 



B k = \[{1-C{n)lf- 1 ), 



where C(n) is their rank-dependent constant. Since the series 

y — 

Z—l v n-l 
i=l r 

converges for n > 2, it follows that the products B k converge to some constant B. 
Now, for any e we can pick k in such a way that |Bjt — B\ < e, while 



R k = > — f < e. 



By the union bound, the probability that some projection is not surjective is 
bounded above by (1 - B k ) + R k < B + 2e. So, as long as e <c (1 - B)/2, we get 
the probability that at least one projection does not surject is bounded above by 
(1 - B)/2. In reality, of course, if the walks get very very long, the true probability 
of profinite density is bounded below by B. 

The observations above show that the modular projections are surjective for 
all prime moduli with positive probability. Then, using some group-theoretic 
arguments we can show that there is a positive probability of surjection for all 
moduli. Then probabilities one gets are completely effective. 
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Since the first example of a profinitely dense (free) subgroup of SL(n, Z) was 
constructed by Steve Humphries in his beautiful paper II281 , we call the groups 



described in Example|3.11 Humphries groups. Now for the question: 



Question 3.12. Suppose we are given a subgroup G of SL(n,Z) given by matrix 
generators. How hard it is to decide whether it is a Humphries group? 



Question 3.12 is quite difficult. Notice that the full lattice SL(n, Z) is a Humphries 
group (by the definition above), so there is a natural dichotomy: Either G is the 
whole SL(n, Z) or, by the congruence subgroup property, it is infinite index, and 
further, by ||42]| (or the sharper and more general EH)/ G is Zariski-dense in 
SL(n, Z). This dichotomy is not really relevant from the algorithmic standpoint 
- we know from the work of Matthews, Vaserstein, Weisfeiler [|46| that for any 
Zariski-dense subgroup G only a finite number of projections is not surjective, 
and if we could bound the number, we could just check surjectivity for every 
possible exceptional modulus - such a check can be performed efficiently using, 
for example, the algorithm of Neumann and Praeger ||48l . Unfortunately, getting 
a bound using the generators is not so easy. The first advance came (after the 
author raised the question at an MSRI Hot Topics conference) very recently, in the 
work by A. Rapinchuk [57|, but the bounds there, while explicit, are not really 
computationally useful, as Rapinchuk prominently states in the paper. 

On the toher hand, the beginning of the discussion in the paragraph above begs 
the question: 

Question 3.13. Given a collection of matrices in SL(n, Z) do they generate SL(n, Z)? 

There appears to be only one practical approach: that is, compute the funda- 
mental domain of the span of the matrices on the homogeneous space of SL(n, IR). 
If we are lucky, and that domain is finite-volume, we can answer the question (the 
author has conducted a number of experiments along these lines). No general 
attack seems to be available, and it is not even clear whether the question is decid- 
able! Similar sounding questions (like the membership problem) are undecidable 
in SL(n, Z) for n > 4, but the techniques seem unapplicable here. In special cases 
(which are central to the study of mirror symmetry, see 116711810 the question can be 
decided by a rather diverse set of approaches. 

4. Punctured (or not) torus 

Consider the modular group SL(2, Z). Our first set of results will use ordering 
by Frobenius norm of the matrix. 

Definition 4.1. The frobenius norm of the matrix x = f ^ is \\x\\ = Vfl 2 + b 2 + c 2 + d 2 . 
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The first question is: 

Question 4.2. How many elements x e SL(2,Z) have ||x|| < N? 

It is surprising that this question was first answered by Morris Newman in 
1988(!) El: 

Theorem 4.3. The number A4 of elements x e SL(2,Z) with \\x\\ < k is asymptotic to 
6k 2 . 



M. Newman's proof of Theorem 4.3 begins by reparametrizing SL(2), as follows. 
First define the following variables: 

A = a + d 
B = b + c 
C = b-c 
D = a - d 

We see that A 2 + B 2 + C 2 + D 2 = a 2 + b 2 + c 2 + d 2 . Further note that 
(5) 4 = 4(ad - be) = A 2 + C 2 -B 2 - D 2 , 

while 



(6) A = tr 



a b] 
c d) 



Since the difference between A and D is 2d, we know that 

(7) A = D mod 2, 
and for the same reason 

(8) B = C mod 2. 

Then Newman writes down a generating function for the number of matrices in 
SL(2, Z) with prescribed Frobenius norm in terms of the theta function 



e(x) = £ 



ti 

oox 



and uses classical estimates on the coefficients of products of theta functions to 
obtain the asymptotic result of Theorem 4.3 Since an exposition of this method 
would take us too far afield,let's use the parametrization above to count those 
elements with trace equal to 2 (which is to say, the parabolic elements). Equations 
Q and ([6]) tell us that the number of such matrices with Frobenius norm bounded 
by k is exactly equal to the number of Pythagorean triples of norm bounded by k. 
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Now, as is well-known, pythagorean triples (A,B,C) with A 2 = B 2 + C 2 are 
rationally parametrized by 

(9) A = u 2 + v 2 

(10) B = u 2 -v 2 

(11) C = 2uv, 

With this parametrization, the 2-norm of (A, B, C) equals V2(u 2 + v 2 ), so the number 
of pythagorean triples with L 2 norm bounded above by X equals the number of 
pairs (u, v) with L 2 norm bounded above by 2 1 / 4 Vx, which, in turn, is asymptotic 
to V2~7iX. Note that the congruences ([7]) and ^ tell us that 2uv = a - d. This 
overcounts by a factor of two (since (u, v) and (—u, -v) give the same Pythagorean 
triple, but on the other hand, parabolic matrices are allowed to have trace equal to 
±2, so when the smoke clears, we have: 

Theorem 4.4. The number of parabolic matrices in SL(2, Z) and Frobenius norm bounded 
above by k is asymptotic to ^flnk. 

Now, we make the following observation: 

Observation 4.5. The characteristic polynomial of a matrix in SL(2, Z) factors over 
Z if and only if the matrix is parabolic (so has trace ±2.) 

Proof. Indeed, if M e SL(2, Z), the roots of the characteristic polynomial x(M) are 

trM± Vtr 2 M-4 
2 ' 

For x(M) to factor, tr 2 M - 4 must be a perfect square, which obviously happens 
only when | trM| = 2. □ 

We thus have the following: 

Theorem 4.6. The probability of a matrix in SL(2,Z) of Frobenius norm bounded above 

by x to have reducible characteristic polynomial is asymptotic to — . 

5. Looking for random integer matrices 

Consider the following simple question: Let v be an Archimedean valuation in a 
lattice (below we will be discussing SL(n, Z), but the question is just as interesting 
for any other lattice in a non-compact Lie Group (not necessarily semi-simple). 

Question 5.1. Given k, how do we choose a random element x uniformly with 
v(x) <k? 



Even for SL(2, Z), Question 5.1 seems completely open, but here is an idea: 
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5.1. A line of attack for SL(2, Z). To get an approximately uniform element, based 
on the fact that the homogeneous space of SL(2, IR) is the hyperbolic plane H 2 : 
Take a basepoint in H 2 (since we will be eventually interested in SL(2, Z), the 
Poincare halfspace model is popular, and there the point i = V-T is a popular 
choice of basepoint). Now, the matrices in SL(2, IR) with Frobenius norm bounded 
by N translate i by hyperbolic distance at most some f(N), so pick a disk D of 
radius g(N) in H 2 , and pick a point x uniformly at random, x will lie in some 
fundamental domain of the SL(2, Z) action. The point x corresponds to a lattice 
L c IR 2 , which can be reduced (using Legendre' algorithm - basically continued 
fractions) - this corresponds to finding a matrix m(x) in SL(2, Z) which maps the 
fundamental domain of x to the "standard" fundamental domain of the modular 
group SL(2, Z). The matrix m(x) is our candidate for the uniformly random element 
of SL(2, Z) we seek (the fact that equidistribution on H 2 leads to equidistribution 
on SL(2, IR) is standard, see for example |fl7|). 

There are two problems with the above approach. Firstly, m(x) might not satisfy 
the norm constraint. If that is the case, we throw it away, and try again - if g(N) is 
not too big, this process will terminate reasonably quickly. The other problem 
is that the area in the disk D is only approximately equidistributed amongst 
fundamental domains (more precisely, the intersection of D with the union of 
the translates of the basic fundamental domain by matrices satisfying the norm 
inequality is only approximately equidistributed among these translates). There 
is a trade-off between the two problems: the larger disk we take, the more uniform 
the distribution is, but the less likely we are to get a point satisfying the norm 
inequality, so some care is required in designing this algorithm properly. 

5.2. A line of attack for SL(n, Z) and Sp(2n, Z). The method described in Section 
|5.1| can be extended to higher dimensions, for at least the special linear and sym- 
plectic groups. The homogeneous space for SL(n,Z) (symmetric positive definite 
matrices with determinant 1, and for Sp(2n, Z) (the Siegel halfspace) have been 
known for several decades, and sampling uniformly from the ball in that space 
is easy, using what Lie Theorists call the KAK decomposition, and most other 
people call the singular value decomposition. The Haar measure (induced by that 
on the Lie group) is easy to compute, and a random element in the ball of the 
homogeneous space is easy to sample. 

What is not so easy is the lattice reduction step. Lattice reduction is a much 
studied problem, since the groundbreaking work of Lovasz (as embodied in the 
LLL) algorithm, a good survey is B53l (interestingly, the symplectic version of 
the problem had not been considered until quite recently, see [24]). The problem 
is that the complexity of exact lattice reduction is, at present, exponential (in 
dimension n of the lattice. The LLL algorithms, and the various improvements run 
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in polynomial time, but they don't necessarily get us to the canonical fundamental 
domain. They do get us near the canonical fundamental domain, which brings up 
a fundamental question: 

Question 5.2. What are the statistical properties of the currently used lattice reduc- 
tion algorithms. 

In other words, if we run the algorithm we sketched not with a precise lattice 
reducer, but with an approximate one (like LLL), will the matrices we get be 
uniformly distributed in the ball in SL(n, Z) or Sp(2n, Z)? 

The basic principle of the method described works for lattices over number field, 
and not just over Z. In fact, a version of lattice reduction for such is described in 
[22 J. The version over SL(2,Z) can be easily made to work to generate a random 
matrix in an arithmetic Kleinian group - the continued fraction algorithm analogue 
is described in |55J. It would be interesting to analyse the non-arithmetic case. 

6. A NON-IDEALIST APPROACH TO RANDOMNESS 

A much more tractable, from the computational standpoint, approach to gener- 
ating random elements of fairly arbitrary (finitely generated) groups is the follow- 
ing: 

Take a symmetric generating set S = {g\, ...,£>} of our favorite group G, (where 
"symmetric" means that the set is invariant under the map x i-> x" 1 , and look at 
the set Wk words of length k in the elements of S. The statement that the group G 
is finitely generated means that 

\Jw k = G. 

k>0 

Now, the trick is to use the definitions in Section |3j but apply them not to the group 
G itself but to the free monoid Ms on S, with 

v(x) = the word length of x. 

This approach has many advantages: it is trivial to produce a random element (just 
multiply elements at random), there are different techniques for proving results, 
and, in the linear group context this model is closely related to the study of random 
matrix products, which has a long history and a considerable record of success (a 
standard reference is 0)- The (fairly obvious) disadvantage is that the structure 
of Ms has nothing to do with the structure of G, and it is very difficult to estimate 
the relationship between the number of occurences of a given group element in 
a u-ball of some given radius R. Consequently, doing probability on M$ instead 
of G is has a certain air of capitulation to it. On the other hand, the free monoid 
model can be refined, as follows: Elements in the free monoid can be identified 



14 



IGOR RIVIN 



with walks in the complete graph 'Kk on k = \S\ vertices (we use the term "complete 
graph" in a somewhat nonstandard way: every vertex of is connected to itself 
in addition to all the other vertices (see Figure [SI. If we remove the requirement 




Figure 1. The recognizing automaton for the free monoid on two 
generators is the very complete graph; bB = aA = 1 

that a graph be the (very) complete graph, and, indeed, a directed, as opposed 
to undirected graph, we find ourselves in the world of regular languages, and it 
was a major discovery of Jim Cannon's, expanded upon by a number of people, 
including David Epstein and Bill Thurston (see the classic book fT0|) that such 
regular languages are a good way of describing a large class of groups (the so- 
called automatic groups). For such groups, the length of a walk in the defining 
automaton is a very good valuation - in particular, it coinsides with the distance 
in the Cayley graph from the identity element). 

However, it turns out that this is too broad a context to be able to demonstate 
sharp results, and so much of the author's work (see [59l|60H64l so far has centered 
on a smaller set of automatic structures: namely, we consider only undirected 
graphs, which, in addition, have the Perron-Frobenius property: there is a unique 
eigenvalue of the adjacency matrix of maximal modulus. This has the immediate 
benefit of bringing the free monoid model closer to reproducing structures of actual 
interest. For example, here is the graph which generates reduced words (for two 
generator groups, the general case is similar): This brings up the relatively obvious 
question: 

Question 6.1. Which groups have an automatic structure where the accepting au- 
tomaton has the properties of being 
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(1) undirected. 

(2) Perron-Frobenius? 

Let's call the combination of properties 1 and 2 property R. 
First, a conjecture: 

Conjecture 6.2. Every word hyperbolic group has property R, with respect to some 
generating set. 

It is not clear that property R is generating set invariant. While the answer to 



Question 6.1 is obvious of interest, and the only groups known to have property R 
are free groups, there is a "cheap" way to extend the techniques to a bigger class 
of groups, as described in the next section. 

6.1. What if your group is not free? As a simple example of a non-free group, we 
take the modular group Ai = SL(2,Z). As is well known, this group is almost, but 
not quite free. More precisely, 

M = (s,r|s 2 ,r 3 ) = c 2 *c 2 , 

where C p is the cyclic group of order p and ★ denotes the free product. The obvious 
symmetric automaton which accepts C p ★ C q has p + q - 2 vertices, corresponding 
to T,T 2 ,. .., T v ~ l , S, . . . , S q ~ l . Every vertex corresponding to V is connected to all 
vertices of the form SK This works wonderfully, except for the minor matter of not 
representing the identity element and the not-so-minor matter of being bipartite, 




Figure 2. The recognizing automaton for the free group on two gen- 
erators F 2 .;bB = a A = 1 
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hence not having property R. However, this can seemingly be fixed by making a 
new graph with (p - l)(q - 1) vertices (corresponding to the products S'Ti) and q - 1 
start states (corresponding to T, . . . , q - 1. 1 believe that this technique will allow 
the methods used in Il59~ll50~ll64l to be extended to this class of groups. 



7. A NON-IDEALISTIC APPROACH TO SL(2, Z) 

Using the automata described in either of the Figures [6] or [6 



we can study 

SL(2,Z) using the random walk approach described in Section 6} The idea is 
simple: consider an n-step walk on the recognizing graph G. Since the group 
SL(2, Z) is a bit too big for us (it is infinite, for one thing), let's do a quick warm- 
up, and consider the group Ai p = SL(2,Z/pZ). How many elements of Ai p have 

trace equal to 2? A matrix M e M has trace 2 if it has the form M = \ J°_ ) , 

where a(2 - a) - be = 1. The last equation can be rewritten as 

(12) be + (a - l) 2 = 0. 

Now, if a = 1, Eq.([T2]> has 2p - 1 solutions (p - 1 solutions with b, but not c equal to 
0, p - 1 solutions with c, but not b, equal to zero, and (0,0).) If a £ 1, Eq. ( [T2) > has 
p - 1 solutions of the form (b, -(a-l) 2 /b). This gives a total of 2p - 1 + (p - l) 2 = p 2 
matrices with trace equal to 2. On the other hand, the order of Ai p is equal to 
p(p 2 - 1), so for large p there is a probability of around 1 /p that M picked uniformly 
from Aip picked uniformly at random has trace equal to 2. 

What does this have to do with the problem at hand? Note that a matrix which 
has trace equal to 2 has trace equal to 2 for every prime p. This means that if 
the walks on our graph G are equidistributed in Ai p for some p, the asymptotic 
probability that an element is parabolic is at most 1 /p. The key is now the following 
theorem of [59 J: 

Theorem 7.1. Let G be a graph with property R, with vertices labeled by generators 
y\,...,y v of a finite group T. Then, the walks of length k become equidistributed in T, 
exponentially quickly as a function ofk, unless all of the are sent to the same complex 
number by some one-dimensional irreducible representation ofT. 

Since Ai^ has no irreducible one-dimensional representations, we see that the 

asymptotic probability is smaller than any 1/p, and hence the asymptotic density 
is 0. In fact, using the fact that SL(2,Z), while not enjoying property T, does 
enjoy property t for congruence representations (see EH), we can show that the 
probability of being parabolic decreases exponentially in the length of the walk (see 

ini). 
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It should be remarked that for specific generating sets, explicit growth rates 
have been computed in GH68]|. In the (chronologically) the first of these papers 
(|68J) Takasawa views SL(2,Z) as the mapping class group of the torus. In the 
second, Korkmaz and Atalan study the mapping class group of the four-punctured 
sphere, but the two objects are, in fact, the same (though the generating sets are 
different). This follows from the fact that for any hyperbolic structure on the 
punctured torus there is the elliptic involution, the quotient by which is an orbifold 
of signature (0; 2, 2, 2, oo), while each quadruply punctured sphere admits an order 
four symmetry group (the Klein four-group) of involutions (this can be seen in 
many ways, one of which being that each complete finite-area structure on the 
four times punctured sphere can be realized uniquely as the induced metric on an 
ideal symplex in H 3 , see II62ID , the quotient by which is the self-same orbifold. The 
reader wishing a much harder algebraic proof of this fact can consult |69J. 

7.1. Polynomial versus exponential. The attentive and inquisitive reader may 
have noticed that in Section [4] we showed that for the matrices of norm bounded 
above by x, there was a probability of order 1/x of finding a parabolic, while this 
probability decreases exponentially fast for the graph walk model. The simplest 
way of explaining this is the following: the number of distinct elements of length 
k grows exponentially in k (the order of growth depends on the generating set), so 
the probability of being parabolic is decaying only polynomially in the size of the 
sample space. 

The second simplest explanation (closely related to the first) is that at least 
in the simplest possible walk model, the expected norm of the products grows 
exponentially in the length of the walk (this follows from the classical theory of 
random matrix products, see 0. 

The case of SL(2, Z) is particularly interesting (as it always is). A particularly 
popular generating set for SL(2, Z) is the set {L, U} where 



l= ; : \, u 





Any matrixM = ^ j , where we assume that b > a > 0, d > c > (the other cases 

are similar) can be written as M = U a °L ai . . . U a ' , where b/d = [a , r \. In this 

case, the word length of M in terms of the generators L, U is simply the sum of 
the continued fraction coefficients of b/d - the word length with respect to other 
generating sets is within a multiple (obtained by writing the "new" generators S, T 
as words in U, L and vice versa). It turns out that the sum of continued fraction 
coefficients is not so easy to analyze, though not for lack of trying. The first 
reference seems to be the paper of Andy Yao and Don Knuth in 1975 [75J, where 
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the authors show that the average value of the sum S(p/q) of the continued fraction 
coefficients of p/q taken over all < p < q satisfies: 



- Yj S(p,q)~^(logq) 2 



' 0<p<q 

The (log q) 2 growth is a little deceptive, since the distribution has "fat tails. " Indeed, 
I. Vardi had shown in [72] that for any a satisfying 1 > a > \, we have 

S(p/q) < (log^) 1+a , p < q <n, with at most £ (N 2 (log N) 1/2 ~ a+e ) exceptions. 

Vardi's paper is mostly concerned with Dedekind sums, which can be interpreted 
as the alternating sum of continued fraction coefficients. The reader is referred 
to Vardi's very nice preprint 117311 for more on continued fractions and related 
mathematics. 

8. Higher mapping class groups 

The techniques which work for the torus at first appear to fail resoundingly 
for the mapping class group of more complicated surfaces, since the mapping 
class group in that setting is not linear. Luckily, there is a workaround: The 
Torelli homomorphism T s a homomorphism from the mapping class group Ai(S) 
of a (closed, for simplicity, and oriented) surface S of genus g to Sp(2g,Z). A 
mapping class (p is mapped by T to its action on the first homology group of S. 
This action preserves the intersection pairing, and so the image of T is contained 
in the symplectic group. In addition, the following fact is standard: 



Fact 8.1 (see EL]). The image ofT is all of the symplectic group Sp(2g,Z). 

Except for the cases of the torus and the four-times punctured sphere, the map T 
has a nontrivial kernel, known as the Torelli group X(S) of the surface S. The Torelli 
group contains pseudo-anosov elements, so the Torelli homomorphism does lose 
a lot of information. Nonetheless, there is the following theorem due to A. Casson 
(see (HI): 

Theorem 8.2. Suppose the matrix M = T((p) has the following properties: 

(1) The characteristic polynomial x(M) is irreducible. 

(2) The characteristic polynomial x(M) is not cyclotomic. 

(3) The characteristic polynomial x(M) is not of the form f(x k ),for some k > 1. 
Then (p is pseudo-Anosov. 

In view of Fact 8.1 the question of showing that a generic element of At(S) is 
pseud- Anosov reduces to snowing that a generic (in the sense of Sectio n[6|) e lement 



of the symplectic group Sp(2g, Z) satisfies the conditions of Theorem 8.2 This is 
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a done by what is, philosophically (in a sense that was later made precise by E. 
Kowalski in Il34l0 , a sieving argument. We show that the properties desired by 
Theorem 8.2 are enjoyed with a probability independent of the prime p by elements of 
the quotient group Sp(2g,Z/pZ). Since, by strong approximation (see [56J, or, for 
a more elementary approach, in [51]), the reductions modulo different primes are 
independendent, and the properties described in Theorem 8^2] are assymetric, in 
the sense that, for example, in order to conclude that a polynomial is irreducible it 
is enough to find a single prime for which the reduction mod p is irreducible^ brings 
us close to the end. The end is achieved thanks to the fundamental equidistribution 
result Theorem 7.1| together with property T for the groups we are studying for 
the groups we are interested in to assure exponential convergence (fl60lQ. 



8.1. The good news. The argument sketched above (see ||59ll60| for all the details) 
has many virtues. Firstly, it is very general (just how general was outlined by 
Lubotzky and Meiri in |4D|V 

In particular, it can be used to show that a generic element in the outer auto- 
morphism group of a free group is irreducible with irreducible powers, which is the 
analogue in that setting of being pseudo-Anosov (an element ip of the automor- 
phism group of a free group is irreducible if it does not preserve any splitting of 
the free group F as a free product F = G ★ H; it is irreducible with irreducible 
powers (iwip) if all powers ip k are, likewise, irreducible). The importance of iwip 
automorphisms was first noted in the foundational paper of M. Bestvina and M. 
Handel |5]. For automorphisms of free groups there is the analogue of the Torelli 
homomorphism, which sends an automorphism \p of a free group F„ to its action 
on the abelianization Z" of F n . It is easy to show that the Torelli homomorphism % 
is surjective onto the automorphism group of Z" - GL(n, Z), and it is easy to see 
that an automorphism xp is irreducible if (of course, not only if) the characteristic 
polynomial of is irreducible. Initially, it seems a little frightening to check 
that \p is iwip by checking the characterstic polynomials of (%(ip)) k for every k for 
irreducibility, but it turns out (see ||59|) that it is enough to check that the Galois 
group of x(X(i/^)) is the full symmetric group. This can be proved by combining the 
previous ideas with the idea (going back to van der Waerden) of characterizing Ga- 
lois groups via the factorization patterns of polynomials modulo various primes. 
This result was later extended in 11301 to show that characteristic polynomials of 



Note that the argument we used for SL(2,Z) is much simpler, since there only one (large) 
prime sufficed, and no strong approximation argument was necessary - it turns out that this kind 
of argument works for SL(n, Z) to show that the characteristic polynomial of a generic matrix is 
irreducible, since it can be shown that the set of irreducible polynomials is a relatively sparse set 
of subvarieties of the set of coefficients 
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matrices in lattices in semisimple Lie group usually have as big a Galois group as 
possible (which is to say, the Weyl group of the ambient Lie group). 

The results are effective (and explicit) n that they give an exponential rate of 
convergence of the densities to 0. 

The results can be extended without any work to finite index subgroups of 
SL(n, Z) and Sp(2g, Z), and their preimages in the mapping class groups, and, 
with some work,and much use of the results of 0, to thin Zariski dense subgroups of 
such groups, and their preimages (these results are still effective, but considerably 
less explicit than for lattices. It can also be shown, using the results of [64J that 
for a generic subgroup H of the mapping class group, a generic element of H is 
pseudo-Anosov. 

However. 

8.2. Bad news. In the context of mapping class groups, our results are not useful 
for groups which have very small image under the Torelli homomorphism. In 
particular, the Torelli group itself is completely "orphaned" - Theorem 8.2 is 
vacuous for elements in the Torelli group. This is all the more galling, since more 
geometric approaches (see Section [9]) show that a generic element of a subgroup 
of the mapping class group which contains at least two non-commuting pseudo- 
Anosov elements (below we will call such subgroups nonelementary) is pseudo- 
Anosov. The problem with these approaches is that the convergence rates are 
completely ineffective, and also they do not apply in the less-geometric situations 
like the automorphism group of a free group. 

8.3. Better news. Recently, at least some of the news became less bad, since two 
groups: A. Lubotzky and C. Meiri ([39]) and J. Malestein and J. Souto (||44| have 
extended the results sketched above to the Torelli group, using very similar meth- 
ods. Lubotzky and Meiri have also extended their results to the Torelli subgroup 
of the automorphism group of the free group in (38|. (The results on the Torelli 
group make sense only when the genus of the surface in question is at least three 
- genericity is hard to define for infinitely generated groups). 

The first idea of these results goes to the beautiful paper of E. Looijenga 11371 . 
later developed in a more algebraic direction by F. Grunewald and A. Lubotzky 

my. 

Consider a surface S and a double cover S. Any homeomorphism the Torelli 
group lifts to a homeomorphism of S. In addition, the image of the lift of the Torelli 
group under the Torelli homomorphism of S is of finite index in PSp(2g - 2, Z), 
where P Sp(», Z) denotes Sp(«, Z)/{±7}. 

This gives us an indication that we might be able to use linear methods to study 
the Torelli group. The next ingredient goes back to the work of N. V. Ivanov [29J: 
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Theorem 8.3 (N. V. Ivanov). Any non-pseudo-Anosov element of the Torelli subgroup 
of a surface S leaves invariant an essential simple curve y on S. 

Note that a much stronger result was shown by B. Farb, C. Leininger, and D. 
Margalitin CH: 

Theorem 8.4 (|18J [Proposition 1.4). Let y be a curve and f an element in the Torelli 
subgroup. Then i(y,f(y)) > 4 if y is nonseparating, then i(y,f'{y)) - 2, for j = 1 or 

i = 2, 1 ' 

where i(x, y) denotes the geometric intersection number of curves x and y. 

Finally, it is noted that y can be used to construct a cover such that the element g 
in the corresponding? Sp(2g - 2, Z) leaves invariant a line in Z2g - 2, from which 
the genericity of pseudo-Anosovs in the whole Torelli follows. 

In fact, one can combine the above with the methods and results of [64] it can be 
shown that a for a generic subgroup of Torelli, a generic element is pseudo-Anosov. 
However, the silver bullet would be the following: 

Conjecture 8.5. For any nonelementary subgroup H of the mapping class group there is 
a cover S to which H lifts, and such that the image ofH under the Torelli homomorphism is 
not solvable, with the degree of the cover at most polynomial in the sums of the wordlengths 
of the generators ofH. 

It should be noted that we are very far from being able to resolve Conjecture 
8.5[ For example, while it is known that for every pseudo-Anosov mapping class 
\p there exists some cover to which \p lifts, and such that the \p is not in the Torelli 
subgroup for that cover (||33|), it is not known that the image ^(ip) is of infinite order! 
The degree of the cover is effective - the bounds in the paper follow essentially 
from the results of Edna Grossman (||2"5H - but, just as Grossman's paper, are easily 
doubly exponential in the word length of the element. 

9. The geometric approach 

Above we have alluded to the "geometric" approach to the mapping class, 
which uses the curve complex. This approach was used by Joseph Maher in l43l 
(where he also shows other remarkable results). Maher 's results are very general, 
but not effective. A somewhat more conceptual approach was undertaken in the 
very nice paper by A. Malyutin 11451 . Malyutin's approach is as follows: 

First, he uses the central limit theorem of M. Bjorklund and T. Hartnick[|6l (which 
is a vast generalization, at the cost of losing effectiveness completely of some of 
the results of the author's paper ||58~|) to show the following: 

Theorem 9.1 (A. Malyutin). Let G be a countable group and let O : G i-> R d is a 

nondegenerate ~R d -quasimorphism. Then, for each nondegenerate probability measure p. 
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on G and for every bounded subset Q c R d , there is a constant C = C(G,<&, p,Q) such 
that for any k e N and x e R rf we have 

p* k (o-^x + Q)) < Ck~ d/2 , 

where p* k denotes the k-fold convolution of p with itself. 

Recall that a quasimorphism from a group G to 1R is a map <p : G h-> 1R such that 
the following condition holds: 

(13) £ - <Kg) - w| < 

An M. d quasimorphism is a map O : G — > M. d which satisfies the inequality ( |i~3| ) 
(where |»| now denotes some Banach norm) - a map is an IR d quasimorphism if 
and only if its coordinates are garden-variety quasimorphisms. Such a O is called 
nondegenerate if its image is not contained in a proper hyperplane. 
An immediate corollary of Theorem 9.1 is the following Corollary: 

Corollary 9.2. If a subset Sofa countable group G has bounded image under a nonde- 
generate IR" '-quasimorphism G h-> IR d , then for every nondegenerate probability measure 
p. on G there exists a constant C = C(p), such that for each k e N we have 

p.* k (S) < Ck~ d/2 . 

The other ingredient is the result of M. Bestvina and K. Fujiwara [4], which states 
that if H is a non-elementary subgroup of the mapping class group of a surface, 
than there are infinitely many linearly independent quasimorphisms which all 
map the non-pseudo-anosov mapping classes to 0. 

The Bestvina-Fujiwara Theorem and Corollary 9.2 together show that the proba- 
bility of being non-pseudo-Anosov decreases faster than any polynomial (but does 
not quite show exponential decay. All constants in the argument are completely 
ineffective). 
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